Unsupervised training with directed manual transcription for recognising Mandarin broadcast audio

نویسندگان

Kai Yu

Mark J. F. Gales

Philip C. Woodland

چکیده

The performance of unsupervised discriminative training has been found to be highly dependent on the accuracy of the initial automatic transcription. This paper examines a strategy where a relatively small amount of poorly recognised data are manually transcribed to supplement the automatically transcribed data. Experiments were carried out on a Mandarin broadcast transcription task using both Broadcast News (BN) and Broadcast Conversation (BC) data. A range of experimental conditions are compared for both maximum likelihood and discriminative training using directed manual transcription. For BC data, using fully unsupervised discriminative training, only 17% of the reduction in character error rate (CER) from supervised training is obtained. By automatically selecting 18% of the data for manual transcription yields 50% of the CER gain from supervised training. The directed approach to selecting data outperforms the use of a random set of data for manual transcription.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Unsupervised Training with Directed Manual Transcription for Recognising Mandarin

متن کامل

Unsupervised language model adaptation for Mandarin broadcast conversation transcription

This paper investigates unsupervised language model adaptation on a new task of Mandarin broadcast conversation transcription. It was found that N-gram adaptation yields 1.1% absolute character error rate gain and continuous space language model adaptation done with PLSA and LDA brings 1.3% absolute gain. Moreover, using broadcast news language model alone trained on large data under-performs a...

متن کامل

Unsupervised training and directed manual transcription for LVCSR

A significant cost in obtaining acoustic training data is the generation of accurate transcriptions. When no transcription is available, unsupervised training techniques must be used. Furthermore, the use of discriminative training has become a standard feature of state-ofthe-art large vocabulary continuous speech recognition (LVCSR) system. In unsupervised training, unlabelled data are recogni...

متن کامل

Speech retrieval of Mandarin broadcast news via mobile devices

This paper presents a system for speech retrieval of Mandarin broadcast news. First, several data-driven and unsupervised approaches are integrated into the broadcast news transcription system to improve the speech recognition accuracy and efficiency. Then, a multi-scale indexing paradigm for broadcast news retrieval is proposed to make use of the special structural properties of the Chinese la...

متن کامل

Dynamic language modeling for broadcast news

This paper describes some recent experiments on unsupervised language model adaptation for transcription of broadcast news data. In previous work, a framework for automatically selecting adaptation data using information retrieval techniques was proposed. This work extends the method and presents experimental results with unsupervised language model adaptation. Three primary aspects are conside...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2007

Unsupervised training with directed manual transcription for recognising Mandarin broadcast audio

نویسندگان

چکیده

منابع مشابه

Unsupervised Training with Directed Manual Transcription for Recognising Mandarin

Unsupervised language model adaptation for Mandarin broadcast conversation transcription

Unsupervised training and directed manual transcription for LVCSR

Speech retrieval of Mandarin broadcast news via mobile devices

Dynamic language modeling for broadcast news

عنوان ژورنال:

اشتراک گذاری